Skip to main content

All Questions

3votes
1answer
34views

Name of algorithm that maps a string column to a float column, based on an aggregation with another float column , similar to TF-IDF

The Question I'm not super familiar with the name's of common algorithms in Data Science, and I feel like this would be something that is commonly used, and so should have a name - want to refer to ...
Mike Kennard's user avatar
1vote
1answer
202views

How to predict what someone will order?

Suppose Prof. X goes to a road side tea-coffee shop everyday at 5pm just after his office. After reaching there he tosses a coin, and places his order tea or coffee. The shop owner Y has been ...
S. M.'s user avatar
5votes
4answers
8kviews

What is the difference between AI, ML, NN and DL? [closed]

What is the difference between the following four categories: Artificial Intelligence (AI) Machine Learning (ML) Neural Network (NN) Deep Learning (DL) Data Science My current understanding is that ...
Capo Mestre's user avatar
1vote
2answers
707views

How do I select the "best" unsupervised machine learning algorithm to cluster my specific dataset?

I want to cluster a dataset without prior knowledge on the correct amount of clusters. For different algorithms (i.e. k-means, gmm...) I can iterate through different values and try to find the best ...
Alex's user avatar
0votes
1answer
76views

kMean clustering for recommendation

I have a file with 50000 rows from a library platform. Each individual row saves a user, and shows the order in which the user, has selected. The books could be from various categories (e.g. roman, ...
Natalia's user avatar
0votes
1answer
130views

Which algorithms should I use for identifying similar characteristics between data points (the intersections)?

I am working with a dataset that has been coded and categorized, so that each datapoint has a set of coded characteristics. An example data point would be something like the following: Example Data ...
Zigrivers's user avatar
1vote
1answer
33views

Need some advice on approach to select only the informative emojis from the data set?

I have a giant data set from a local elections, which contains hashtags, emojis, and comments. I wanted to make a network analysis using only emojis. So far I have a network analysis graph made in R ...
CaseebRamos's user avatar
0votes
1answer
72views

What is the output polytree after aplying the Ramex algorithm to this graph?

I've been trying to understand the way this algorithm works, but I can't get a consistent result. It has two phases: the first one coverts a table of events into a graph, and the second where the ...
Eduardo Ribeiro's user avatar
1vote
0answers
148views

Is the Apriori algorithm suitable for database tuples?

Problem description: I need to use an association rule algorithm that lets me use database tuples and I think Apriori is a good option, but I am not sure. Starting point: I know the Apriori ...
xvlaze's user avatar
2votes
1answer
164views

Efficient way to search list of items in a text document

I have a list of items (size ~50K) and several documents( average page per document ~10). I am trying to find what all items are listed in each document as follows : ...
Sreejithc321's user avatar
2votes
1answer
69views

Finding a Data pattern [closed]

I am new to this data science field. I have data of points in 3D space and each point "helps" a metric. I have the sets of points and corresponding metrics. Data might look like: ...
AGP's user avatar
  • 21
1vote
1answer
301views

Predict ratings for Item Based Collaborative Filtering

Given the (cosine) similarity score of top 100 neighbors of every item, how do I predict ratings for unrated items? Please explain in simple terms. Item 1 260 0.577305 780 0.5655413 1210 0....
saranyavsr's user avatar
1vote
0answers
505views

Pattern recognition in 2D dataset [closed]

I'm not familiar with the ways and tools of data scientists but I have a background in software engineering and mathematics. I'd need to recognize patterns in a two-dimensional data set. I have a lot ...
bencemeszaros's user avatar
1vote
2answers
632views

Shape of a distribution as a feature

How can I use the shape of a distribution as a feature in machine learning ? Do I use something like the standard deviation ?
DPascal's user avatar
1vote
1answer
39views

Implementation of reliable rule learning

I want to perform "reliable rule learning", i.e. mining a set of rules with a very low number of false negatives. I recently read the paper "Reliable agnostic learning" by Kalai et al. (https://doi....
Tobias B.'s user avatar

153050per page
close